better duplicate key stats during index generation #30829

jeffwashington · 2023-03-21T16:41:42Z

Problem

At startup, scan storages to populate index. We can easily identify pubkeys that are in multiple slots (duplicates).
There are metrics on this, but they have misleading names.
More importantly, the duplicate list needs to include the first slot we encounter that contains a given duplicate pubkey so that clean will pick it up correctly.

Summary of Changes

Rename and add metrics.
When we find the first duplicate, also mark as duplicate the first item that was already added which has now become a duplicate.

Fixes #

brooksprumo

lgtm

codecov · 2023-03-21T18:59:42Z

Codecov Report

Merging #30829 (1c0d6d0) into master (ce0e23f) will decrease coverage by 0.1%.
The diff coverage is 100.0%.

@@            Coverage Diff            @@
##           master   #30829     +/-   ##
=========================================
- Coverage    81.3%    81.3%   -0.1%     
=========================================
  Files         726      726             
  Lines      203495   203509     +14     
=========================================
+ Hits       165608   165616      +8     
- Misses      37887    37893      +6

jeffwashington requested a review from brooksprumo March 21, 2023 16:44

better duplicate key stats during index generation

1c0d6d0

jeffwashington force-pushed the mm18 branch from 0bcad86 to 1c0d6d0 Compare March 21, 2023 16:49

brooksprumo approved these changes Mar 21, 2023

View reviewed changes

jeffwashington merged commit 2216647 into solana-labs:master Mar 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

better duplicate key stats during index generation #30829

better duplicate key stats during index generation #30829

jeffwashington commented Mar 21, 2023

brooksprumo left a comment

codecov bot commented Mar 21, 2023

better duplicate key stats during index generation #30829

better duplicate key stats during index generation #30829

Conversation

jeffwashington commented Mar 21, 2023

Problem

Summary of Changes

brooksprumo left a comment

Choose a reason for hiding this comment

codecov bot commented Mar 21, 2023

Codecov Report